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Abstract. We derive an exact expression for the probability density function of the cascade 
t^ ^ size (total progeny) in a continuous state branching process when the generations are Gamma 

distributed. The distribution has application in the modelling of cascade processes such as 
landslides and electrical network failures. 

p I I 1. Continuous state branching processes 

1 -Q . 1.1. General Theory. We may define a continuous state branching process in the following 

C^ I way, see [H [2] for further details. Without loss of generality we let the size of the zeroth 

generation be Xq = 1. The size of the first generation is then drawn from some distribution 
G with support [0, oo], so Xi ~ G. We now write G*" for the distribution of the sum of 
n independent copies of Xi. We extend this definition to non integral n as follows; if g is 
the Laplace transform of the density function for the distribution G, then (^)" is the Laplace 
transform of the density function for G*". The size of the nth generation is then a random 
variable with the following distribution: 






(~^ ', This defines a branching process. 



Xn '^ <~J 



1.2. A Gamma Branching Process. The Gamma distribution T{k,9) has the probability 
density function 



X e 



. <^.' If X ~ r(A;, 9) then E{X) = k9, and Var{X) = kO^. The distribution has the following property 

which holds for all n G M^: 

T{k,ey'' = r{nk,e). 

We make use of this property in setting up the following branching process. Let Xq = 1 be the 
size of the zeroth generation, and let: 

Xi~r(2,p). 

We have made the choice k = 2 here but the following analysis may be generalised to arbitrary 
k. The size of the nth generation is distributed as: 

Xn ~ r(2,p)*^"-l = Ti2Xn-l,p). 

The total cascade size in an infinite system: 

oo 

Z = Y,Xk 

k=0 

may be infinite. In what follows we will compute the probability of this event: ¥{Z = oo}. 
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2. The cascade distribution 

It is possible to derive the exact probability density function for the total size, Z of the cascade 
by taking the continuum limit of a discrete branching process. 

2.1. Negative binomial approximation to the Gamma distribution. We begin by noting 
that the negative binomial distribution, which has probability mass function: 

b{n,r,q)= \ / 1 - gfg" 
nil [r) 

provides an arbitrarily close discrete approximation to the gamma distribution for appropriate 
choice of the parameters r and q. The approximation is set up in the following way. We divide 
[0, oo] into a discrete lattice of constant spacing 6, and let Xs be a discrete random variable 
which approximates X ~ T{k,9). Let Xs have the probability mass function: 

F{Xs = n6) = b{n,r,q). 

The mean and variance of Xs are then: 



E{Xs) 



Var{Xs) 



6pr 
1 — p 
6'^pr 



{1-pY 

By requiring that these match the mean and variance of X, we find that: 

kO 

r = 

9-6 

9-6 



With these choices of r and q, in the limit (5 — )■ the discrete distribution converges to r(A;, 9) 
in the following sense: 



1. r, ,., ., „^ ., „^■, X^ ^6 



hm-b[lx/6\,r{k,9),q{k,9)] 



The advantage of thinking of the Gamma distribution as the limit of a negative binomial lies in 
the fact that the cascade distributions may be calculated explicitly in the discrete setting. We 
may extend the idea of non-integral convolution to the negative binomial distribution by making 
use of the following property. If y ~ NB{r,q) is a negative binomial variable, the the sum of 
m independent copies of Y has distribution NB{mr,p). Setting 5 = — we may think of our 
negative binomial approximation to the gamma distribution as the sum of m negative binomial 
variables, Yi S {0, (5, 25, . . .}, each with distribution: 

Yi^NB(-,q). 

We will refer to this as the atomic distribution A. We can approximate non integral convolutions 
of the Gamma distribution as integral convolutions of the atomic distribution as follows: T*-^''' ~ 

2.2. Cascade distribution for the discrete state branching process. From here on we 
set k = 2 and 9 = p. The particular values of r and q in the atomic distribution required so that 
the negative binomial approximates the gamma distribution are: 

p — 6 

q = 

p 
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Let Zs{m) be the total cascade size starting from m individuals where the number of offspring 
produced by each individual is distributed according to the atomic distribution. We note that 
if (5 = 1/m then Zs{m) ~ Z. Because the the numbers of offspring produced by each individual 
are independent then Zs{m) has the same distribution as the sum of m independent copies of 
Zs{l). Let Y be an ^-distributed variate then: 

(1) Zs{l) = 1 + Zs{Y) 

If H{s) and F(s) are the probability generating functions for Zs{l) and Y, then from equation 
([T|) we have 

= sE[E(s^*(^) I Y)] 

= sn{F{s)f] 

= sFiH{s)). 
From the negative binomial mass function we have that: 



oo . 



n=0 



We are interested in the probability generating function of Zs{m), which is just H"^{s). The 
coefficient of s" in this function may be determined using the Lagrange inversion formula: 



777 

n 

m r(n(l + r*) — m) 



*\''' n(^*\n—m 



n r(nr*)r(n — m + 1) 
^{Zs{m) = n] 



\q 



We now have the probability mass function for the cascade size in the discrete branching process 
which approximates the continuum process that we are interested in. 

2.3. The continuum limit. We obtain the continuum cascade density function, which we will 
call g{x), by setting n = x/5 and m = 1/5 and then taking the limit 5 — > 0: 

(2) gix) = lim -^{Zsim) = n} 

5-s-O 

^ (^^l)2.-lg-(|+21np)x+i 



xr(2x) 

The asymptotic properties of ^(a;) may be determined by making use of Stirling's approximation: 

T{z + 1) ~ \/2^ (f )^ The result is: 

i-2+ln2 -fi^+21n2p)x 

S{x) '■ 7z—^ 3 as X ^ cx) 



From this we see that the distribution is asymptotically a pure power law when p = o 



2- 
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2.4. Moments in the subcritical case. Provided p < ^, the distribution g{x) is normahsed 
and its moments are defined. It is useful to have explicit expressions for the mean and variance 
of g{x) in this case. We first compute the mean and variance of the distribution of Zs{l) using 
the generating function relationship: H{s) = sF{H{s)), which after differentiation reveals that: 

F{H{s)) 



H'{s) 
H"{s) 



1 - sF\H{s)) 

2H'{s)F'{H{s)) + s{H'{s)fF"{H{s)) 



1 - sF'{s) 

Using the expression for F{s), together with the fact that when p < 2 H{1) = F{1) = 1, we 
find that: 

E{Zsil)) = 5H'{1) 



nZsil?) - nZsil)? = 5\H"{1) + H\l) - H'{lf) 



l-2p 
2(5p2 



(1 - 2p)3 
Since, in the limit (5 — t- 0, Z has then same distribution as the sum of ^ copies of ^^(1), then: 

E(Z) ^ 



l-2p 
2 



E{Z^) - E{Zf - ^^' 



(1 - 2p)3 

It is worth noting that these expressions would have been difficult to obtain by direct integration 
over g{x). 

3. Probability of the event {Z < 00} 

Numerical integration of the exact distribution ^ reveals that it is not normalized when 
p > 2- The total probability weight is equal to P{Z < 00} which is less than one in the 
supercritical case. We will now show that: 

/oo 
g{x)dx = e^^P^ 

where: 

x{p) = f 2iy_i I 




2p 

and W-i is one of the two real branches of the Lambert W function, the other being Wq. We 
may derive this result by mapping the discrete branching process on to a random walk, and then 
constructing a Martingale to which the optional sampling theorem may be applied. 

We must show first that the total cascade size in a discrete branching process has the same 
distribution as a first passage time for a random walk. We consider the process with offspring 
distribution A (the atomic distribution) and suppose that Xq is the size of the zeroth generation. 
We identify Xq with the initial position of the walker. The size, Xi, of the next generation is then 
the sum of Xq A-distributed random variables. Now, suppose that V ^ A and define Q :=V —1. 
We write the distribution of Q as A~ , and note that Xi has the same distribution as the position 
of a walker after Xq, ^"-distributed steps. Note that since A~ has support {—1, 0, 1, 2, . . .} then 
if the walker does reach the origin then it will do so at the Xo-th step. The sizes, {X2, X3, X4, . . .} 
of subsequent generations may be viewed as the positions of the same walker after, respectively, 
{Xi,X2,X3, . . .} steps. The cascade ends when a generation has zero size, which occurs when 
the walker reaches the origin, and the total cascade size Xq + Xi + X2 + . . . is the total number 
of steps taken by the walker. The cascade size therefore has the same distribution as the time 
of first intersection of the A~ distributed walker with the origin. 
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It remains to compute the probability that walker will return to the origin. We do this using 
martingales. The aim is to construct a martingale from the A~ walk of the form: 

where Sn is the position of the walker after n steps, and then to apply the optional sampling 
theorem, which states that under certain conditions (which will be satisfied for us) E(Afr) = Mq 
when r is a stopping time measurable w.r.t. the information contained in the walk up to the 
nth step. The value of a which makes M„ a martingale satisfies the equation: K[a'^) = 1, which 
has the explicit form: 

1 / 1 . . 

1, 



a \1 — q*a 
or, written in terms of p and 5: 

2pS 2pS 

Apart from the trivial solution ai = 1, when p > \ this equation has another solution a2 < 1 
which approaches 1~ as 5 — )■ 0. Writing a = 1 — e and defining x = e/6 we find that the solutions 
to dSI) converge, as (5 — )• 0, to the solutions of 

X = 21n(l +px), 

the appropriate one being: 

To approximate the Gamma branching process we must start the atomic branching process with 
Xq = 1/6. Optional sampling tells us that if T is the first step at which the walk reaches the 
origin or oo then 



E(a^ ) = F{Zs < oo} = a 



Taking the limit 5 — )■ we find that: 



which reproduces our claim ([4]) 



F{Z < oo} = lim ¥{Zs < oo} 
5—5-0 

= lim(l — 6x{p))'i 
5->o 

= g-a;(p) 



4. Concluding comment 



Branching processes are useful in the modelling of cascading failures, amongst many other 
applications. In the simplest case, the generations of the cascade are integer random variables, 
but this is not always the most appropriate model. The exact expression derived here for 
the probability distribution of cascade sizes the branching process with Gamma distributed 
generations is therefore likely to be useful tool in applications. 
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